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EXPRESSION OF G-CSF AND MUTEINS THEREOF AND THEIR USE 

The Invention concerns the field of recombinant DNA. In 
particular, the Invention concerns the Isolation of nucleic acid 
sequences encoding granulocyte colony-stimulating factor (G-CSF) and 
5 the expression of G-CSF 1n various host cells. The Invention concerns 
modified DNA sequences encoding G-CSF and mutelns thereof that are 
efficiently expressed under the control of promoters operable In £. 
coli, expression vectors utilizing the modified DNA sequences for the 
efficient expression of G-CSF 1n E. coli and Gram-negative hosts known 
10 to exchange genetic material with E. co11» host organisms Including 
microorganisms transformed with the expression vectors and G-CSF 
produced thereby. 

Interest in the growth and differentiation of granulocytes 
and macrophages from bone marrow progenitors has prompted the study of 

15 the colony-stimulating factors (CSFs) that regulate these processes. 
The subject has been recently reviewed by D. Metcalf in Blood, 67:257 
(1986), One of these factors, termed granulocyte-CSF (G-CSF) 
stimulates the in vitro growth of predominantly granulocyte colonies 
from bone marrow stem cells. The availability of cell lines that 

20 produce CSFs has been Instrumental in the characterization of G-CSF; 
however, proper interpretation of this work requires the precise 
identification of the CSF released by these cells. One such cell 
line, the MIA PaCa-2, derived from a human pancreatic carcinoma, 
reportedly produces both the macrophage-speclfic CSF-1 and a CSF that 

25 can Induce the formation of primarily granulocyte colonies in seven 
day bone marrow cell cultures. Wu, M. C, J. Clin. Invest., 67; 1588 
(1981). A CSF-1 cDNA clone has been recently isolated from this cell 
line, Kawasaki et al«, Science, 230: 291 (1985). They also showed, by 
translation of sucrose gradient fractionated mRNA In Xenopus laevis 

30 oocytes, that a transcript that sedimented slower than the 18s marker 
encoded a CSF that Induced the proliferation of murine bone marrow 
cells; however, the CSF-1 clone hybridized most strongly to mRNA In 
the fractions that sedimented faster than the 18s marker. 
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Human G-CSF causes the differentiation of granulocytes and 
monocytes from bone marrow progenitor cells. It also Induces some 
leukemia cells to differentiate Into more mature cell types. Thus, 
human G-CSF may be useful 1n regeneration of Important blood cell 
5 populations 1n Immunologically compromised patients and patients with 
anemic disorders or Infections. G-CSF may also be useful in 
Inhibiting leukemia by causing leukemia cells to differentiate to a 
non-dividing state, thereby checking the uncontrolled proliferation of 
these cell types. Myeloid leukemia cells are known to have receptors 
10 for G-CSF, for example. Science, 323: 7 (1986). 

Souza et al • have isolated, cloned, and expressed a gene 
encoding G-CSF, which has no significant homology to GM-CSF, in an 
unidentified strain of E. coll using a modified temperature sensitive 
runaway plasmid with a x phage PL promoter and synthetic DNA fragment 
15 containing an Initiation codon, followed by the sequence encoding the 
mature form of unaltered G-CSF. No data on expression levels are, 
however, disclosed. Souza et al., Science, 232:61-65 (1986). 

In addition, a colony stimulating factor having the same N- 
terminal 20 amino acids as that of Souza et al . was isolated from a 

20 cultured human oral carcinoma as disclosed 1n European Patent 
Publication 0169,566 to Nagata et al. This clone designated CHU-2, 
has an additional sequence encoding Val-Ser-Glu just prior to the 
cy s 36 residue of the protein encoded by the DNA sequence cloned from 
MIA PaCa-2, as described herein below. No information Is given in 

25 either of the above-mentioned disclosures as to the levels of 
expression in recombinant hosts, particulaly E. coli. Nagata et al. 
subsequently reported a cDNA sequence that lacks the sequence encoding 
the above-mentioned additional tri-peptide sequence. Nagata et al., 
EMBO Journal 5:3, 575-581 (1986). 

30 Reportedly, both of these recombinantly produced G-CSFs have 

NH 2 terminal methionines as a result of expression in the recombinant 
bacterial host E. coll. Such N^-terminal methionines are generally 
known to be immunogenic when the protein is administered parenterally 
to a mammalian host. 
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A number of factors are Involved In obtaining maximal 
expression of a protein 1n a recombinant host. Among these factors 
are optimization of the physiological conditions for growth of the 
host cell by changes to the culture media in which the host grows. 
5 Also Important is the selection and use of strong promoter and 
ribosome binding sites (RBS) that are operable 1n the expression 
vector to be used. The elimination of attenuator regions if they 
occur in the region of regulated promoters such as the tryptophan 
operon 1s also desirable. 

10 A more subtle factor 1n maximizing protein expression In a 

recombinant host relates to codon utilization. In brief, codon 
utilization may be described as the ability of a particular 
recombinant host organism to translate some codons encoding a 
particular amino acid more quickly and accurately than other different 

15 codons encoding the same amino add. Thus, although the trinucleotide 
code specifying a particular amino acid 1s degenerate, i.e., more than 
one trinucleotide sequence can encode a particular amino acid, all 
codons encoding a particular amino acid are not utilized by the host 
with equal efficiency. Enhanced expression may be obtained by 

20 changing the nucleotide sequence of a gene in a recombinant host to 
the host preferred codons without changing the amino acid sequence of 
the protein encoded thereby. Additionally, changes in the amino acid 
sequence of the NH 2 terminus may effect processing of the N^-termlnal 
methionine mentioned above. 

25 Another subtlety in the enhancement of gene expression in an 

expression vector in a recombinant host relates to secondary structure 
formation in the mRNA sequence to be expressed. It has been shown, 
for example, that certain "stem and loop" secondary structures placed 
in proximity to the 3 1 terminus of a gene, lead to Increased stability 

30 of the mRNA transcript and Increased expression of the protein encoded 
thereby, Wong et al., Proc. Nat. Acad. Scl. (USA), 83:10, 32-33 
(ig86). However, 1t has been suggested that formation of stable 
secondary structures at the 5' end of an mRNA transcript may block 
efficient translation thereof, leading to reduced yield of the protein 

35 encoded by the gene. The Influence of mRNA secondary structure on 
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translation has been discussed 1n Iserentant & Flers, Gene. 9:1-12 
(1980) and Buell et al. t Nucleic Add Research. 13; 1923-1938 (1985). 
DeLamarter, J. F# f et al., EHBO Journal . 4,10 :2575-2581 (1985) have 
reported that the yield of GM-CSF has been Increased In E. coll by 
5 means of altering third position deoxyrlbocytoslne to 
deoxyrlboadenlnes 1n the first ten of the 5 1 codons of the DNA 
encoding mature GM-CSF. and attribute the Increase In protein 
synthesis to Jhe altered 5' end of the mRNA transcript- 
It would be desirable to obtain large amounts of G-CSF for 
10 development of therapies utilizing the granulocyte stimulating and 
granulocyte-macrophage stimulating characteristics of this colony 
stimulating factor. Furthermore, it would be desirable to obtain 
forms of G-CSF that lack NH 2 -terminal methionine when expressed 1n the 
recombinant host, and particularly in E. coli. 

15 The present invention makes available G-CSF in large 

quantities by producing It at high levels in a number of hosts. 

The Invention discloses that a previously isolated mRNA 
fraction from MIA PaCa-2 cells having high bone marrow proliferative 
activity encodes G-CSF. The invention encompasses the G-CSF mRNA. In 
20 addition, the Invention encompasses a cDNA sequence made from this 
mRNA fraction that encodes the G-CSF gene. 

The Invention also Includes DNA sequences encoding mature G- 
CSF; these DNA sequences have been altered to facilitate expression of 
G-CSF in recombinant hosts. In addition, DNA sequence encoding mature 
25 • G^CSF or a mutein thereof lacking NH 2 -terminal methionine when 
expressed in a recombinant host, are disclosed. 

At present, Investigators have encountered great difficulty 
in obtaining significant expression of mature G-CSF in E* coli hosts 
under Pl promoter gene N-RBS control or tryptophan (Trp) promoter 
30 control when the unaltered nucleotide sequence of the mature G-CSF 1s 
used. The Inventors have determined that, when under P L gene N-RBS 
control 1n E ; coll. steady state levels of mRNA encoding G-CSF are 
detectable by Northern analysis as described further herein. Under 
the control of the Trp promoter, no steacty state G-CSF mRNA is found 
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in E. coli by the same method. Surprisingly, 1n light of the work of 
Delemarter, et al. which suggests that alterations of the 5' end of 
the GM-CSF gene affects secondary structure of the GM-CSF mRNA 
sequence and the translation thereof, the Inventors have found that 
5 alterations 1n the 5' region of the 6-CSF gene leads not only ta 
Increased expression of G-CSF protein 1n both P L gene N-RBS and Trp 
controlled expression vectors 1n E. coll , it also leads to Increased 
levels of mRNA steady state transcript in Trp controlled expression 
vectors 1n E. colt. 

10 Figure 1A is a plot of bone marrow proliferation activity 

obtained for Xenopus oocyte supematants made from the fractionated 
MIA PaCa-2 mRNA. 

Figure IB is a dot blot of MIA PaCa-2 mRNA fractions with a 
CSF-l-specific probe. 

15 Figure 2 is a Northern blot of the RNA obtained from MIA 

PaCa-2 cells (lane 1), LD-1 cells (lane 2), and 5637 cells (lane 3), 
probed with two different 24-mer probes. In 2A the probe was 5 1 - 
ATGGCTGGACCTGCCACCCACAGC-3' f which hybridized to RNA of all three 
lanes. In 2B the oligomer probe was 5 1 -AGAAGCT GGTGAGT6A6T GTGCCA-3 1 . 

20 The nine nucleotides underlined appear only 1n the CHU-2 sequence that 
purports -to be G-CSF (Nagata et al., supra) . This probe did not 
hybridize to any of the cell lines tested under stringent conditions. 

Figure 3A is a plot of the bone marrow proliferation 
activity of the supernatants collected from Xenopus laevls oocytes 40 
25 hours after injection of the pooled MIA PaCa-2 RNA fractions. 
Activity was measured by the murine bone marrow cell proliferation 
assay described in the examples below. 

Figure 3B is a Northern blot of the pooled MIA PaCa-2 mRNA 
fractions using a G-CSF specific y 32 P labeled oligodeoxyrlbonucleotide 
30 probe. 

Figure 4 shows the complete sequence of G-CSF cDNA obtained 
from the sequencing of the BamH I digest of plasmid pP12 and deduced 
amino acid sequence encoded thereby. The arrow indicates the position 
of the nine nucleotide sequence GTGATGGAG of the Nagata et al. clone. 
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Figure 5 1s a schematic Illustration showing the prepartlon 
of plasmld pPDl and the placement of a Hindi 1 1 restriction site and 
ATG codon 1n the G-CSF sequence* 

Figure 6 1s a schematic Illustration of the prepartlon of 
5 plasmld pPD2 1n which the 3* untranslated region of the G-CSF gene 1s 
excised and transcription termination and mRNA stability are placed 
under control of the positive retroregulatory element. 

Figure 7 is a schematic Illustration of the preparation of 
plasmid pJDl 1n which G-CSF expression 1s under the control of the P L 
10 promoter and gene N-ribosome binding site. 

Figure 8 1s a schematic illustration of the preparation of 
pJD4, pJD4A and pJD4B. 

Figure 9 1s a schematic Illustration of the preparation of 

pPD5. 

15 Figure 10 is a Northern blot of mRNA from pJDl and pPD2 

probed with a G-CSF specific ollgodeoxy nucleotide probe. 

Figure 11 shows the ON A sequence and N- terminal amino add 
sequence encoded by native G-CSF, pJD4B, pPD5 and pJD4A. The ATG 
codon encoding Met was added to native G-CSF and does not occur in the 
20 cDNA sequence obtained from MIA PaCa-2 transcripts. 

Figure 12 is an SDS gel showing expression of G-CSF in 
induced and uninduced E. coli host carrying pJD4A and pJD4B. 

Figure 13 is an SDS gel showing expression of G-CSF in 
induced and uninduced E. coll host carrying pPD5. 

25 Figure 14 1s a schematic illustration of the preparation of 

plasmld pPD6 in which yj-CSF expression is under control of the P L 
promoter and gene N-ribosome binding site. 

Figure 15 is an SDS gel showing expression of mG-CSF and Vj- 
mG-CSF. The Vj-mG-CSF mobility reflects a slightly smaller size 
30 consistant with deletion of NH 2 - terminal Met and Thr. 
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A, 6-CSF Cloning and Expression 

The G-CSF of the present Invention was detected 1n a 
purified mRNA fraction obtained by sucrose density gradient 
centrlfugatlon of Induced MIA PaCa-2 cells as described 1n PCT 
5 Publication No. W086/04607 published August 14 , 1986 and assigned to 
the assignee of the present Invention. 

mRNA fractions obtained from the gradient were Injected Into 
Xenopus laeyls oocytes; the oocytes were Incubated, and the 
supernatants tested for the ability to stimulate murine bone marrow 

10 proliferation (bmp). Peak bmp activity was found in supernatants from 
oocytes that had been Injected with the 14, 15 and 16$ MIA PaCa-2 mRNA 
fractions although peak hybridization to a CSF-1 specific probe was 
obtained with the 18S mRNA fraction. This phenominon 1s Illustrated 
1n Figure 1A which shows a plot of bmp activity and CSF hybridization 

15 intensity per mRNA fraction. 

Further, Investigation by the present inventors has 
determined that the peak bmp activity associated with the slower 
sedimenting RNA fractions is due to the activity of G-CSF as is shown 
in Figure 3A and 3B. These figures show that peak hybridization of a 
20 G-CSF specific probe with MIA PaCa-2 mRNA fractions sedimenting slower 
than 18S. Thus, the present Invention encompasses an mRNA fraction of 
human cells that Is capable of producing G-CSF in a host cell. 

"G-CSF" as used herein means a protein having the effect of 
stimulating the production of primarily granulocyte colonies or 
25 granulocyte-macrophage colonies in a colony forming ass^y using bone 
marrow cell progenitors of an appropriate species. A protein having 
this activity has the deduced amino acid sequence shown in Figure 4 
herein and is considered to be within the scope of the invention. 

G-CSF according to the invention may be isolated from the 
30 MIA PaCa-2 cell line. In addition, as shown in Figure 2, G-CSF 
specific RNA sequences are clearly detectable in the mRNA of LD-1 
cells and 5637 cells. Thus, G-CSF obtained by cloning from these cell 
lines as well as any other cell line which is G-CSF probe positive, 1s 
considered within the scope of the Invention. Mature G-CSF produced 
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by the LD-1 cell line has been Isolated and amino-acid sequenced. 65% 
of the LD-1 produced protein has a thr residue at the NH 2 -term1nus and 
35% has a pro residue at the NH 2 - terminus. 

As 1s the case for all proteins, the precise chemical 

5 structure of G-CSF depends on a number of factors. As ionlzable amino 
and carboxyl groups are present 1n the molecule, a particular protein 
may be obtained as an acidic or basic salt, or 1n neutral form. All 
such prepartlons which retain their activity when placed 1n suitable 
environmental conditions are Included In the definition. Further, the 

10 primary amino acid sequence may be augmented by derlvatization using 
sugar moieties {glycosylate ) or by other supplementary molecules 
such as lipids, phosphate, acetyl groups and the like, more commonly 
by conjugation with saccharides. The primary amino add structure may 
also aggregate to form complexes. Certain aspects of such 

15 augmentation are accomplished through post-translatlonal processing 
systems of the producing host; other such modification may be 
Introduced in vitro. In any event, such modifications are included in 
the definition so long as the activity of the protein, as defined 
above, 1s not destroyed. It 1s expected, of course, that such 

20 modifications may quantitatively or qualitatively affect the activity, 
either by enhancing or diminishing the activity of the protein in the 
various assays. 

Further, individual amino add residues in the chain may be 
modified by oxidation, reduction, or other derlvatization, and the 
25 protein may be cleaved to obtain fragments which retain activity. 
Such alterations which do not destroy activity do not remove the 
protein sequence from the definition. 

Modifications to the primary structure itself by deletion, 
addition, or alteration of the amino acids incorporated into the 
30 sequence during translation can be made without destroying the 
activity of the protein. Such substitutions or other alterations 
result in proteins having an amino acid sequence which falls within 
the definition of proteins "having an amino acid sequence 
substantially equivalent to that of G-CSF*. 
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For convenience, the mature G-CSF protein amino acid 
sequence shown in Figure 4, deduced from the cDNA clone illustrated 
herein, is designated mG-CSF (mature G-CSF) beginning at the amino 
acid residue threonine designated +1. Figure 4 shows the presence of 

5 a 30 residue putative signal sequence, which is presumably cleaved 
upon secretion from mammalian cells; mG-CSF is represented by amino 
acids 1-174 shown in that figure. Specifically included in the 
definition of human G-CSF are muteins which monomers and dimers, if 
any, are G-CSF and related forms of G-CSF, designated by their 

10 differences from mG-CSF • G-CSF derived from other species may fit the 
definition of "human" G-CSF by virtue of its display of the requisite 
pattern of activity as set forth above with regard to human substrate. 

Also for convenience, the amino acid sequence of G-CSF will 
be used as a reference and other sequences which are substantially 

15 equivalent to this 1n terms of G-CSF activity will be designated by 
referring to the sequence shown in Figure 4. The substitution of a 
particular amino acid will be noted by reference to the number of the 
amino acid residue which it replaced. Thus, for example, sergg G-CSF 
refers to the protein which has the sequence shown in Figure 4 except 

20 that the amino acid at position 60 is serine rather than proline. 
Deletions are noted by a v followed by the numer of amino acids 
deleted from the N-terminal sequence, or by the number of amino acids 
remaining when residues are deleted from the C-terminal sequence, when 
the number is followed by a minus sign. Thus, v^-G-CSF refers to 6- 

25 CSF of Figure 4 wherein the first four amino acids from the N-termlnus 
have been deleted; v 130 - refers to G-CSF wherein the last 44 amino 
acids following amino acid 130 have been deleted. 

"Operably linked" or "1n operable linkage" refers to 
juxtaposition such that the normal function of the components can be 
30 performed. Thus, a coding sequence "operably linked" to control 
sequences refers to a configuration wherein the coding sequence can be 
expressed under the control of these sequences. 

"Control sequences" refers to DMA sequences necessary for 
the expresion of an operably linked coding sequence in a particular 
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host organism. The. control sequences which are suitable for 
procaryotes, for example, include a promoter, optionally an operator 
sequence, a RBS, and possibly, other as yet poorly understood, 
sequences, Eucaryotic cells are known to utilize promoters,, 
polyadenylation signals, and enhancers. 

"Expression system" refers to DNA sequences containing a 
desired coding sequence and control sequences in operable linkage, so 
that hosts transformed with these sequences are capable of producing 
the encoded proteins. In order to effect transformation, the 
expression system may be included on a vector; however, the relevant 
DNA may then also be integrated into the host chromosome. 

As used herein "cell", "cell line", and "cell culture" are 
used interchangeable and all such designations. Include progeny. Thus 
"transformants" or "transformed, cells" includes the primary subject 
cell and cultures derived therefrom without regard for the number of 
transfers. It is also understood that all progeny may not be 
precisely identical in DNA content, due to deliberate or inadvertent 
mutations- Mutant, progeny which have the same functionality as 
screened for in the originally transformed cell, are included. Where 
distinct designation's are intended, it will be clear from the context. 

Control Sequences and Corresponding Hosts 

Procaryotes most frequently are represented by various 
strains of E. coli. However* other microbial, strains may also be 
used, such as bacilli, for example Bacillus subtil is, various species 
of Pseudomonas, or other bacterial strains. In such procaryotic 
systems, pi asmid vectors which contain replication sites and control 
sequences derived from a species compatible with the host are used. 
For example, £. coli is typically transformed using derivatives of 
pBR322, a pi asmid derived from an E. coli species by Bolivar, et al., 
Gene, 2^:95 (1977). pBR322 contains genes for ampictllin and 
tetracycline resistance, and thus provides additional markers which 
can be either retained or destroyed in constructing the desired 
vector. Commonly used procaryotic control sequences which are defined 
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therein to Include promoters for transcription Initiation, optionally 
with an operator, along with RBS sequences, Include such commonly used 
promoters as the beta-lactamase (penicillinase) and lactose (lac) 
promoter systems (Chang, et al., Mature (1977) 198:1056) and the 
tryptophan (Trp) promoter system (Goeddel, et al., Nuclei c Acids Res. 
(1980) 8:4057) and the lambda derived P L promoter and N-gene Hbosome 
binding site (SMmatake, et al., Nature (1981) 292:128), which has 
been made useful as a portable control cassette, as set forth 1n PCT 
Publication No. W085/03522 published August 15, 1985. However, any 
available promoter system compatible with procaryotes can be used. 

As described 1n detail herelnbelow, the native G-CSF 
sequence has not proved to be effective for the production of G-CSF 1n 
such prokaryotic microbial hosts at high levels. In the present 
invention, alterations have been carried out to the 5' end of the mG- 
CSF coding sequence. In addition to placing an ATG start codon at the 
5' end of the DNA sequence encoding mG-CSF, a number of changes have 
been made in the nucleotide sequence thereof without changing the 
amino add sequence of the mG-CSF protein encoded thereby. 

These nucleotide changes encompass codons primarily encoding 
the first 10 amino acids of mG-CSF. At least one nucleotide In the 
third position of at least one of these codons has been changed from a 
deoxyriboguanidine or deoxyribocytoslne to a deoxyriboadenlne. These 
changes are made without altering the amino acid residue encoded by 
the codon, and furthermore, the resulting altered codon 1s one which 
1s not necessarily host-preferred, at least when the host cell is 
coli. More preferably, the third position codon alteration 1s carried 
out on at least one of the first 10 or 11 codons of mG-CSF wherein the 
5 1 amino acid sequence of mG-CSF remains 
ThrProLeuGlyProAl aSerSerLeuPro or MetThrProLeuGlyProAl aSerSerLeuPro, 
when the additional codon encoding an N- terminal Met is Included. 

More preferably, the above-described G to A or C to A codon 
alterations are carried out on at least one of the first 4 or 5 codons 
of mG-CSF wherein the 5' amino add sequence of mG-CSF remains 
ThrProLeuGly or MetThrProLeuGly when the additional codon encoding an 
N-termlnal methionine is Included. 
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In a more preferred embodiment, the above-mentioned changes 
. to the third position nucleotide of codons encoding the 5' end of mG- 
CSF are carried out to at least three of the 10 codons of the DNA 
sequence encoding the amino acid sequence 

5 ThrProLeuGlyProAl aSerSerLeuPro. 

In an even more preferred embodiment, the above-mentioned 
changes to the third position nucleotide of codons encoding the 5 1 end 
of mG-CSF are carried out to at least three of the 4 codons of the DNA 
sequence encoding the amino acid sequence ThrProLeuGly. 

10 When the above-mentioned changes are made in the 5 1 end of 

the DMA sequence encoding mG-CSF that 1s 1n an E. coll compatible 
expression vector in operable linkage with the Trp promoter, both 
stead(y state mRNA transcripts encoding mG-CSF and a protein fraction 
having mG-CSF activity are readily detectable by Northern blotting and 

15 Xenopus laevis oocyte assay, bone marrow proliferation, or granulocyte 
colony stimulation assays, respectively. By contrast, the native mG- 
CSF DNA sequence in otherwise operable linkage with the Trp promoter 
yields neither mRNA transcript nor detectable by Northern blot nor a 
protein fraction having G-CSF activity in the above-mentioned assays. 

20 The native G-CSF DNA sequence in an expression vector 1n 

operable linkage with the P L promoter gene N-RBS, yields detectable 
steacjy state mRNA transcripts encoding G-CSF, but no protein fraction 
having mG-CSF. However, when the altered mG-CSF DNA sequence 
according to the invention 1s placed 1n the same vector under the same 

25 promoter RBS control, a protein fraction is produced having mG-CSF 
activity. 

The invention also concerns muteins of G-CSF in which the N- 
terminal methionine is not present when G-CSF Is produced from a 
recombinant host. In prokaryotic recombinant hosts such as E. coll. 
30 the translation initiation codon ATG encodes the amino acid 
methionine. As a result, the amino or NH 2 - terminus of recombinant 
eukaryotic proteins produced in these recombinant hosts has a 
methionine residue that is generally not found in the protein when 
produced 1n the eukaryotic cell from which the DNA was initially 
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obtained. This superfluous NH 2 -term1nal methionine, however, can be 
processed and removed by certain exopeptidase enzymes, most 
particularly methionine amino peptidase or met amino peptidase. This 
enzyme although capable of removing NH 2 -terminal amino acid residues, 
5 specifically does not remove the NH 2 - terminal methionine with equal 
efficiency from all proteins. In particular, the efficiency of the 
enzyme varies depending upon the identity of the amino acid residue 
adjacent to the NH 2 -terminal methionine. 

The precise specificity of the enzyme in this regard cannot 
10 be determined without testing nearly an infinite number of substrates; 
however, a useful rule 'of thumb is set forth by Sherman, F. , et al., 
Bio Assays 3:27-31 (1985). Sherman et al. based their considerations 
for specificity of Met-amino-peptidases on the observed forms of 
mutants of iso-l-cytochrome-C from yeast and the published primary 
15 sequence of 82 mature intracellular proteins. They conclude that 
methionine is usually cleaved from residues with a side chain having a 
radius of gyration of 1.29 A or less, but generally not cleaved from 
residues with side chains larger than 1.43 A. This is consistent with 
the observation that mutationally altered 1so-l-cytochrome-C taken in 
20 consideration with other published sequences of other proteins from 
procaryotic and eucaryotic systems Indicate that N-terrainal methionine 
is cleaved when it precedes residues of alanine, cysteine, glycine, 
proline, serine", threonine, or valine, but not when 1t precedes 
residues of arginine, asparagine, aspartic acid, glutamine, glutamic 
25 acid, Isoleucine, leucine, lysine, or methionine. These results are 
generally consistent with those set forth in the illustrations 
below. However, some exceptions occur where the radius of gyration 
for the second amino acid is higher than 1.29 A and the secondary and 
tertiary structure or other conditions are particularly favorable. 
30 Therefore, this aspect of the specificity of the enzyme is intended as 
a general guideline and 1t should be borne 1n mind that even the 
specificity of particular amino peptidase have not been determined 
with exact precision, I.e., not all possible tertiary structures have 
been tested. Therefore, 1n order to fall within the definition of 
35 "Met-aminopept1dase", the enzyme needs only to meet the requirement of 
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specific cleavage of N- terminal methionine without cleavage of 
Internal methionine residues and without cleavage of N-term1nal 
residues other than methionine from any peptide* 

Among the N^- terminal amino add sequences that may be 
5 processed by amino terminal methionine peptidase, are those 1n which 
the methionine is followed by ala, gly or pro. In addition, NH 2 - 
terminal methionine followed by ala-met, gly-met, ala-ser, gly-gly, 
ala-pro, or pro-thr is expected to be processed. 

Thus, any of the form of G-CSF or muteins of G-CSF in which 

10 the above-mentioned sequences comprise the N-terminal portion of the 
amino acid will fall within the definition of G-CSF according to the 
invention so long as the biological activity of G-CSF is maintained. 
In particular, a deletion mutant of G-CSF, 1n which the threonine 
residue following the MH 2 - terminal methionine 1s deleted to yield a 

15 v^-G-CSF 1s Illustrated in some detail hereinbelow. Specifically 
mutated forms of G-CSF in which the thrj residue is substituted with 
ala, or gly, ala-ser, gly-gly, or in which the order of pro and thr 
are reversed will also fall within the scope of the invention. DNA 
species encoding the above-indicated muteins of G-CSF may be made by 

20 site specific mutagenesis of the cDNA encoding native G-CSF using 
known techniques. Oligonucleotides encoding these muteins may also be 
substituted for DMA sequences encoding the native G-CSF amino terminal 
amino add sequence through ligation of o!1godeoxyr1bonucleot1des 
encoding the altered sequenced Into specifically digested G-CSF DNA 

25 sequences. 

B. General Methods for Carrying out the Invention 

Transformations 

Depending on the host cell used, transformation 1s done 
using standard techniques appropriate to such cells. The calcium 
30 treatment employing calcium chloride, as described by Cohen, S. N., 
Proc. Natl. Acad. Sc1» (USA) (1972) 69:2110, or the RbCl 2 method 
described 1n Maniatis, et al., Molecular Cloning; A Laboratory Manual, 
(1982) Cold Spring Harbor Press, p. 254 was used for procaryotes or 



15 



0256843 



other cells which contain substantial cell wall barriers. For 
mammalian cells without such cell walls, the calcium phosphate 
precipitation method of Graham and van der Eb, Virology (1978) 52:546 
may be used or the method of Wang, et al.. Science, 228:149 (1985). 
5 Transformations into yeast are carried out according to the method of 
Van Solingen, P., et al., J. Bact. (1977) .130:946 and Hsiao, C. L., et 
al ., Proc. Natl. Acad. Sci. (USA) (1979 ) 76:3829. 

Screening cDNA Libraries 

cDNA libraries are screened using the colony hybridization 

10 procedure. Lifts of colonies are made onto nitrocellulose filter 
papers (S & S type BA-85). The colonies are lysed and DMA fixed to 
the filter by treatment for five minutes with 0.5 M NaOH, 1.5 M NaCl , 
and are then washed twice for five minutes each time with 1.0 M Tris 
pH 8, 3 M NaCl. Filters are air dried and baked at 80°C for two 

15 hours. The duplicate filters are prehybridized at 45-50°C for one 
hour in 5 x SSC, 10 x Denhardt's solution (0.21 polyvinylpyrrolidone, 
0.2% Ficoll, 0.2% BSA), 0.1% SDS, 50 mM sodium phosphate pH 7.0, and 
100 jig/ml tRNA. 

Hybridization of the filters is done in a solution similar 
20 to that described above for prehybridization, but also contains 10% 
dextran sulfate with kinased probe at 1 x 10 6 CPH/ml under conditions 
which depend on the stringency desired. Typical moderately stringent 
conditions employ temperatures of 45-50°C for 16-20 hours with 1-5 
ml/fHter of DNA hybridization buffer containing probe. For higher 
25 stringencies higher temperatures are employed. The filters are washed 
three times for 15 minutes each time at appropriate temperatures using 
3 x SSC, 0.1% SDS, air dried, and are autoradiographed at -70°C for 
two to three days. 

Vector Construction 

30 Construction of suitable vectors containing the desired 

coding and control sequences employs standard ligation and restriction 
techniques which are well understood in the art. Isolated plasmids, 
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DNA sequences, or synthesized oligonucleotides are cleaved, tailored, 
and rellgated in the form desired. 

Site specific DNA cleavage 1s performed by treating with the 
suitable restriction enzyme (or enzymes) under conditions which are 
5 generally understood 1n the art, and the particulars of which are 
specified by the manufacturer of these commerdaly available 
restriction enzymes. See, e.g., New England Blolabs, Product 
Catalog. In general, about 1 pg of plasmid or DNA sequence Is cleaved 
by one unit of enzyme 1n about 20 pi of buffer solutions. In the 
10 examples herein, typically, a 3-10 fold excess of restriction enzyme 
is used to insure complete digestion of the DNA substrate. Incubation 
times of about one hour to two hours at 37 °C or other appropriate 
temperatures are workable, although variations can be tolerated. 
After each Incubation, protein is removed by extraction with 
15 phenol /chloroform, and may be followed by ether extraction, and the 
nucleic acid recovered from aqueous fractions by precipitation with 
ethanol followed by running over a Sephadex G-50 spin column. If 
desired, size separation of the cleaved fragments may be performed by 
polyacylamide gel or agarose gel electrophoresis using standard 
20 techniques. A general description of size separations Is found 1n 
Methods in Enzymology (1980) 65:499-560 or Manlatls, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY (1982). 

Restriction cleaved fragments may be blunt ended by treating 
25 with the large fragment of E. coll DNA polymerase I (Klenow) 1n the 
presence of the four deoxynucleotlde triphosphates (dXTPs) using 
incubation times of about 15 to 25 minutes at 20 to 25°C 1n 50 mM Tris 
pH 7.6, 50 mM NaCl, 6 mM MgCl 2 , 6 mM DTT and 5-10 yM dXTPs. The 
Klenow fragment fills In at 5' overhangs but chews back protruding 3* 
30 single strands, even though the four dXTPs are present. If desired, 
selective repair can be performed by supplying only one of the, or 
selected, dXTPs within the limitations dictated by the nature of the 
sticky ends. After treatment with Klenow, the mixture Is extracted 
with phenol /chloroform and ethanol precipitated followed by running 
35 over a Sephadex G-50 spin column. Treatment under appropriate 
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conditions with SI nuclease results 1n hydrolysis of any single- 
stranded portion. 

Synthetic oligonucleotides are prepared by the triester 
method of Matteucci, et al. ( J. Am. Chem. Soc. (1981) 103:3185) or 
5 using commercially available automated oligonucleotide synthesizers. 
Kinaslng of single strands prior to annealing or for labeling is 
achieved using an excess, e.g., approximately 10 units of 
polynucleotide kinase to 0.1 nmole substrate in the presence of 50 mM 
Tris, pH 7.6, 10 mM MgCl 2 , 5 mM dithiothreitol , 1-2 mM ATP, 1.7 pmoles 
10 T-32P-ATP (2.9 mCi/mmole), 0.1 mM spermidine, 0.1 mM EDTA. 

Ligations are performed in 15-30 w l volumes under the 
following standard conditions and temperatures: 20 mM Tris-Cl pH 7.5, 
10 mM MgCl 2 , 10 mM DTT, 33 yg/ml BSA, 10 mM-50 mM NaCl, and either 40 
V M ATP, 0.01-0.02 (Weiss) units T4 DMA Ugase at 0°C (for "sticky end" 
15 ligation) or 1 mM ATP, 0.3-0.6 (Weiss) units T4 DNA Ugase at 14°C 
(for "blunt end" ligation), lntermolecular "sticky end" ligations are 
usually performed at 33-100 pg/ml total DNA concentrations (5-100 nM 
total end concentration). lntermolecular "blunt end" ligations 
(usually employing a 10-30 fold molar excess of linkers) are performed 
20 at 1 pM total ends concentration. 

In vector construction employing "vector fragments", the 
vector fragment is commonly treated with bacterial alkaline 
phosphatase (BAP) in order to remove the 5' phosphate and prevent 
religation of the vector. BAP digestions are conducted at pH 8 in 
25 approximately 150 mM Tris, in the presence of Na + and Mg +2 using about 
1 unit of BAP per pg of vector at 60°C for about one hour. Vector 
fragments subjected to this treatment are referred to herein as 
"bapped". In order to recover the nucleic acid fragments, the 
preparation is extracted with phenol /chloroform and ethanol 
30 precipitated and desalted by application to a Sephadex G-50 spin 
column. Alternatively, religation can be prevented in vectors which 
have been double digested by additional restriction enzyme digestion 
of the unwanted fragments. 
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For portions of vectors derived from cDNA or genomic DNA 
which require sequence modifications, site specific primer directed 
. mutagenesis 1s used. This 1s conducted using a synthetic 
oligonucleotide primer complementary to a single stranded phage DNA to 

5 be mutagenlzed except for limited mismatching, representing the 
desired mutation. Briefly, the synthetic oligonucleotide 1s used as a 
primer to direct synthesis of a strand complementary to the phage, and 
the resulting double-stranded DNA 1s transformed Into a phage- 
supportlng host bacterium. Cultures of the transformed bacteria are 

10 plated in top agar, permitting plaque formation from single cells 
which harbor the phage. 

Theoretically, 50% of the new plaques will contain the phage 
having, as a single strand, the mutated form; 501 will have the 
original sequence. The resulting plaques are hybridized with klnased 

15 synthetic primer at a temperature which permits hybridizaton of an 
exact match, but at which the mismatches with the original strand are 
sufficient to prevent hybridization. Plaques which hybridize with the 
probe are then picked, cultured, and the DNA recovered. Details of 
site specific mutation procedures are described below 1n specific 

20 examples. 

Verification of Construction 

In the constructions set forth below, correct ligations for 
plasrald construction are confirmed by first transforming E. coll 
strain MM294 obtained from E. coli Genetic Stock Center, CGSC 6135, or 

25 other suitable host with the ligation mixture. Successful 
transformants are selected by amp1c1ll1n, tetracycline or other 
antibiotic resistance or using other markers depending on the mode of 
plasmld construction, as 1s understood 1n the art. Plasmlds from the 
transformants are then prepared according to the method of Clewell, D. 

30 B. , et al., Proc. Natl. Acad. Sd. (USA) (1959) 62:1159, optionally 
following chloramphenicol amplification (Clewell, D. B., J. Bacterlol. 
(1972) 110:667). The Isolated DNA 1s analyzed by restriction enzyme 
mapping and/or sequenced by the dldeoxy method of Sanger, F., et al., 
Proc. Natl. Acad. Sci. (USA) (1977) 74:5463 as further described by 
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Messing, et al., Nucleic Adds Res. (1981) 2:309, or by the method of 
Maxam, et al., Methods in Enzymology (1980) 65:499. 

Hosts Exemplified 

Host strains used in cloning and expression herein are as 

5 follows: 

For cloning and sequencing, and for expression of 
construction under control of most bacterial promoters, E. coli strain 
MM294 (supra), Talmadge, K. , et al., Gene (1980) ^12:235; Messelson, 
M., et -a*., Nature (1968) 217:1110, was used as the host. For 
10 expression under control of the P L N-RBS promoter, E. coli strain K12 
MC1000 lambda lysogen, N 7 N 53 cI857SusP 80 , ATCC 39531 (hereinafter 
sometimes referred to as MC1000-39513 XDG95 or DG95) is used. 

For M13 phage recombinants, E. coli strains susceptible to 
phage infection, such as E. coli K12 strain DG98 are employed. The 
15 DG98 strains has been deposited with ATCC July 13, 1984 and has 
accession number 1965. 

In addition to bacteria, eucaryotic microbes, such as yeast, 
may also be used as hosts. Laboratory strains of Saccharomyces 
cerevisiae , Baker's yeast, are most used although a number of other 
20 strains are commonly available. While vectors employing the 2 micron 
origin of replication are illustrated, Broach, J. R., Meth. En*. 
(1983) 101:307, other plasmid vectors suitable for yeast expression 
are known (see, for example, Stinchcomb, et al., Nature (1979) 282:39, 
Tschempe, et al. t Gene (1980) .10:157 and Clarke, L. t et al., Meth. 

25 Enz. (1983) 101:300). Control sequences for yeast vectors include 
promoters for the synthesis of glycolytic enzymes (Hess, et al., 
Adv. Enzyme Reg. (1968) JilA9; Holland, et al., Biochemistry (1978) 
17:4900). Additional promoters known in the art include the promoter 
for 3-phosphoglycerate kinase (Hitzman, et al., J* Biol. Chem. (1980) 

30 255:2073), and those for other glycolytic enzymes, such as 
glyceraldehyde-3-phosphate dehydrogenase, hexosekinase, pyruvate 
decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3- 
phosphoglycerate mutase, pyruvate kinase, triosphosphate isomerase, 
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phosphoglucose Isomerase, and glucoklnase. Other promoters, which 
have the additional advantage of transcription controlled by growth 
conditions are the promoter regions for alcohol dehydrogenase 2, 
Isocytochrome C, add phosphatase, degradatlve enzymes associated with 
5 nitrogen metabolism, and enzymes responsible for maltose and galactose 
utilization (Holland, ibid). Alterations to the 5' end of the mG-CSF 
OKA sequence analogous to those shown to be effective with the Trp 
promoter and P L promoter may also be carried out for use with such 
eukaryotlc promoters. 
10 It is also believed terminator sequences are desirable at 

the 3' end of the coding sequences. Such terminators are found In the 
3' untranslated region following the coding sequences In yeast-derived 
genes. Many of the vectors illustrated contain control sequences 
derived from the enolase gene containing plasmld peno46 (Holland, M. 
15 J., et al., J. Biol. Chem. (1981) 256:1385) or the LEU2 gene obtained 
from YEpl3 (Broach, J., et al., Gene (1978) 8:121), however any vector 
containing a yeast compatible promoter, origin of repldatlon and 
other control sequences Is suitable. 

It is also, of course, possible to express genes encoding 
20 polypeptides 1n eukaryotlc host cell cultures derived fro* 
multicellular organisms as described further hereinbelow. See. 
generally for exawpl e, „ Tj ssue Cul ture , Academic Press, Cruz and 
Patterson, editors (1973). Useful host cell lines Include mur1i» 
myelomas M51, VERO and HeLa cells, COS-7 cells, and Chinese hawttr 
25 ovary (CHO) cells. Expression vectors for such cells ordinarily 
include promoters and control sequences compatible with mammalian 
cells such as, for example, the commonly used early and late promoters 
from Simian Ylrus 40 (SY 40) (F1ers, et al., Nature (1978) 273:113), 
or other viral promoters such as those derived from polyoma, 
30 Adenovirus 2, bovine paplloma virus, or avian sarcoma viruses*.. or 
Immunoglobln promoters and heat shock promoters. General aspects of 
mammalian cell host system transformations have been described by 
Axel, U.S. Patent Mo. 4,399,216 Issued August 16, 1983. It now 
appears, also that "enhancer" regions are Important In optimizing 
35 expression; these are, generally, sequences found upstream of the 
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promoter region* Origins of replication may be obtained, if needed, 
from viral sources* However, integration into the chromosome is a 
common mechanism for DNA replication in eucaryotes. Plant cells are 
also now available as hosts, and control sequences compatible with 
5 plant cells such as the nopal ine synthase promoter and 
polyandenylation signal sequences (Depicker,, A., et al. t J. Hoi, 
Appl. Gen. (1982)^:561) are available. 

The Invention will be better illustrated in relationship to 
the following examples which are intended by the Inventors to be 
10 exemplary and non-1 1m1 ting. 

Example I 

Isolation and Expression of cDNA Encoding Human 6-CSF 

A cDNA clone encoding human granulocyte stimulating factor 
was isolated from the MIA PaCa-2 cell Hne, and was expressed using a 

15 recombinant vector in COS-7 cells. The MIA PaCa-2 cell line described 
1n Yunis, A* A. et al., Experimental Hematol, 12:838-843 (1984), is an 
established cell line publlcally available from the Cell Repository 
Lines (CRL) collection of the American Type Culture Collection, 12301 
Parklawn Avenue, Bethesda, MD 20895 under accession number ATCC CRL 

20 1420. The cDNA clone was sequenced and the corresponding amino acid 
sequence was deduced. 

A. Initial Detection of mRNA From MIA PaCa-2 Cells Having High B MP 
Activity, But Low Hybridization to fcsf-l ProbeT 

The human derived pancreatic carcinoma cell Hne MIA PaCa-2 
25 was used as a source of mRNA to validate CSF-1 specific probes and for 
the formation of a cDNA library containing an Intronless form of the 
human CSF-1 coding sequence. The MIA PaCa-2 cell line produces CSF-1 
at a level approximately 10 fold below that of the murine L-929 cells. 

Negative control mRNA was prepared from MIA PaCa-2 cells 

30 maintained 1n serum-free medium, I.e., under conditions wherein they 

do not produce CSF-1. Cells producing CSF-1 were obtained by 
reindudng CSF-1 production after removal of the serum. 
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Cells were grown to confluence In roller bottles using 
Dulbecco's Modified Eagles' Medium (DMEM) containing 10* fetal calf 
serum, and produce CSF-1 at 2000-6000. units/ml . The cell cultures 
were washed, and relncubated 1n serum-free condition to suppress CSF-1 
5 formation. For negative controls, no detectable CSF-1 was produced 
after a day or two of Incubation without serum. Relnduced cells were 
obtained by addition of phorbol myrlstlc acetate (100 ng/ml ) to obtain 
production after several days of 1000-2000 units/ml . 

The MIA PaCa-2 mRNA was Isolated by lysis of the cell 1n 
10 Isotonic buffer with 0.5% NP-40 In the presence of Hbonucleoslde 
vanadyl complex (Berger. S. L.. et al.. Biochemistry (1979) .18:5143) 
followed by phenol /chloroform extraction, ethanol precipitation, and 
oligo dT chromatography, and an enriched mRNA preparation obtained. 
In more detail, cells are washed twice 1n PBS (phosphate buffered 
15 saline) and are resuspended 1n IHB (140 mM NaCl, 10 mM Tr1s, 1.5 roM 
MgCl 2 . PH 8) containing 10 mM vanadyl adenosine complex (Berger, S. 
L.i et. al., supra ). 

A non-1on1c detergent of the ethylene oxide polymer type 
(NP-40) 1s added to 0.5% to lyse the cellular, but not nuclear 
20 membranes. Nuclei are removed by centrlfugatlon at 1,000 x g for 10 
minutes. The post-nuclear supernatant 1s added to two volumes of TE 
(10 mM Tr1s, 1 mM ethyl enedi ami netetraacetlc add (EDTA), pH 7.5) 
saturated phenol chloroform (1:1) and adjusted to 0.5% sodium dodecyl 
sulfate (SDS) and 10 nW EDTA. The supernatant 1s re-extracted four 
25 times and phase separated each time by centrlfugatlon at 2,000 x g for 
10 minutes. The RNA 1s predplated by adjusting the sample to 0.25 M 
NaCl, adding 2 volumes of 100% ethanol and storing at -20°C. The RNA 
is pelleted at 5,000 x g for 30 minutes. 1s washed with 70% and 100% 
ethanol. and Is then dried. Polyadenylated (polyA + ) messenger RNA 
30 (mRNA) is obtained from the total cytoplasmic RNA by chromatography on 
oligo dT cellulose (Aviv, J., et al., Proc. Natl. Acad. Sd. (1972) 
69:1408-1412). The RNA is dissolved in ETS (10 mM Trls, 1 mM EDTA, 
0.5% SDS, pH 7.5) at a concentration of 2 mg/ml. This solution is 
heated to 65°C for five minutes, then quickly chilled to 4°C. After 
35 bringing the RNA solution to room temperature, it is adjusted to 0.4 M 
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NaCl and 1s slowly passed through an oligo dT cellulose column 
previously equilibrated with binding buffer (500 mM NaCl, 10 n« Tr1s, 
1 mM EDTA, pH 7.5 0.5% SDS). The flow- through 1s passed over the 
column twice more. The column 1s then washed with 10 volumes of 
5 binding buffer. PolyA + mRNA 1s eluted with allquots of ETS, extracted 
once with TE-saturated phenol chloroform and 1s precipitated by the 
addition of NaCl to 0.2 M and 2 volumes of 100% ethanol . The RNA 1s 
repredpltated twice, 1s washed once 1n 70S and then In 1001 ethanol 
prior to drying. 

10 Total- mRNA was subjected to 5-20% by weight sucrose gradient 

centrlfugation 1n 10 mM Tr1s HC1, pH 7.4, 1 nW EDTA, and 0.5% SDS 
using a Beckman SW40 rotor at 20°C and 27,000 rpm for 17 hours. The 
mRNA fractions were then recovered from the gradient by ethanol 
precipitation, and Injected into Xenopus Tagvls oocytes in the 

15 standard translation assay. The oocyte products of the RNA fractions 
were assayed 1n the bone marrow proliferation assay of Moore, R. N. et 
al., J. Immunol. (1983) 131:2374, and of Prystowsky, M. B., et al. t 
Anw, J. Pathol. (1984) 114:149 and the fractions themselves were 
assayed by dot blot hybridization to a 32-mer probe corresponding to 

20 the DNA 1n the second exon of the genomic sequence (exon II probe). 
These results are summarized in Figure 1A and B. 

The broken Hne in Figure 1A shows the response 1n the bone 
marrow proliferation assay of the supernatants from the Xenopus laevis 
oocytes; Figure IB shows the dot blot results. The most strongly 
25 hybridizing fraction, 11, corresponds to greater than 18S, while the 
most active fractions 8 and 9 correspond to 14-16S. 

The mRNA was also fractionated on a denaturing formaldehyde 
gel, transferred to nitrocellulose, and probed with exon II probe. 
Several distinct species ranging 1n size from 1.5 kb to 4.5 kb were 
30 found, even under stringent hybridization conditions. 

B. Isolation G-CSF mRNA From MIA PaCa-2 Cells 

Confluent MIA PaCa-2 cells were stimulated 1n serum free 
Dulbecco's minimum essential medium (DMEM) for 4 days with phorbol 
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n\yr1state acetate (50 ng/ml ) and retlnolc add (10 uM). The RNA was 
prepared as described by Chlrguln et al.; briefly, the cells were 
lysed In 5 M guanldlne Isothlocyanate followed by centrlfugatlon 



In more detail, cells were lysed In a solution which 
contained 5 H guanldlne Isothlocyanate, 0.025 M Na-c1trate, pH 7, 0.5% 
sarcoslne and 8% B-mercaptoethanol . Molecular biology grade CsCl was 
10 made up to 5.7 M or 40* w/v and buffered with 0.02 M Tr1s pH 7.5 and 
0.002 M Na-EOTA. All solutions were prepared under RMase free 
conditions and passed through 0.45 y Mllllpore filters before use. 
The lysed cells were then layered onto SW28 ultracentrlfuge tubes 
which contained layers of 10 ml 5.7 M CsCl and 6 ml 40% CsCl. 

15 After centrlfugatlon at 26,000 rpm for 18 hours, the RNA 

pellet was dissolved 1n dH 2 0 and ethanol precipitated twice. 
Polyadenylated (PolyA + ) messenger RNA (mRNA) was obtained by 
chromatography of the total RNA on ollgo (dT) cellulose as described 
In Manlatls, supra (at page 197). 



20 the RNA 1s dissolved 1n sterile H 2 0 and heated to 65°C for 

five minutes. An equal volume of a solution containing 0.040 M Tr1s 
CI pH 7.6 1.0 M NaCl, 0.002 M EDTA and 0.2% SOS Is quickly added and 
the sample Is cooled to room temperature. The sample Is then' loaded 
on an ollgo (dT) column that has been equilibrated with a buffer 

25 containing 0.020 M Trls pH 7.6, 0.5 M NaCl, 0.001 M EDTA and 0.1% 
SDS. The flow through 1s collected, heated to 65°C, cooled to room 
temperature and passed over the column once more. The column 1s then 
washed with 10 volumes of wash buffer (0.02 Trls, 0.1 M NaCl, 0.001 M 
EDTA, 0.1% SDS). Poly A + mRNA Is eluted with allquots of 0.01 M Trls 

30 pH 7.5, 0.001 H EDTA and ethanol -precipitated twice. 

334 w g of MIA PaCa-2 Poly A + mRNA was fractionated to 5-25* 
by weight sucrose gradient centrlfugatlon 1n 20 mM Trls HC1, pH 7.5, 1 
mM EDTA, and 0.5% sarcoslne using a Beckman SW40 rotor at 20°C and 
27,800 rpm for 16 hours. The mRNA fractions were collected 1n 400 ul 
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fractions and ethanol precipitated twice. Fractions were pooled and 
resuspended in 15 pi ^O* 

Northern blots were prepared by the electrophoresis of 5 pg 
of poly A + RNA per lane or 1 pi of each pooled RNA fraction through 1% 

5 agarose gels containing 0.5 M formaldehyde followed by blotting onto 
nitrocellulose filters. After baking the filters for 1.5 hours at 
80°C, they were prehybridized (5 x SSC, 10 x Denhardt's solution (0.2% 
polyvinylpyrrolidone, 0.21 F1coll, 0.2S BSA), 0.1% SDS, 50 mM sodium 
phosphate pH 7.0, and 100 pg/ml tRNA) for one hour at 55°C. The blots 

10 were hybridized for 16 hours at 55°C in a similar solution that also 
contained 10% dextran sulfate and 10 5 cpm per ml of an oligonucleotide 
probe labeled with y 32 P-ATP and polynucleotide kinase. The blot shown 
in Figure 3B was washed at 55°C in 3 x SSC, 0.1% SDS. This 
oligonucleotide probe had the sequence 5 1 -GTAGGTGGCACACAGCTTCTCCTG-3 1 

15 and was designed based on the sequence of the CHU-2 cDNA clone 
described by Nagata et al. f Nature, 319:415-418 (1986). 

The RNA pools that were described in the above assay were 
translated in a Xenopus laevis oocytes assay by the Injection of 50 nl 
of of RNA Into each oocyte as described in Gurdon et al., Nature , 
20 233 :177-180 (1971). Supernatants of 10 pi per oocyte were collected 
after 40 hours and assayed for CSF activity. 

The Xenopus laevis supernatants from each hybridization 
fraction were assayed for CSF activity 1n a murine bone marrow cell 
proliferation assay as described in Moore, et al., J . Immunol . > 

25 131:2374-2378 (1983). Briefly, In this assay, 5 x 10 4 murine bone 
marrow cells per/well were incubated in 96-well plates (12 x 8) with 
serially diluted Xenopus laevis oocyte supernatants made from 
positively hybridizing mRNA fractions. After three days, 3 H thymidine 
(0.5 pCi/well) was added, and after six hours the cells were harvested 

30 and counted 1n a liquid scintillation counter. 

Peak bone marrow proliferation was found in Xenopus laevis 
oocyte supernatants made with mRNA fractions that were most strongly 
positive in Northern blots with the above-described oligonucleotide 
probe as shown in Figure 3. 
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C. Identification of G-CSF Clones In a MIA PaCa-2 cDNA Library 

A cDNA library was prepared from the enriched MIA PaCa-2 
mRNA as described In Kawasaki, et al., Science, 30:291-296 (1985). 
Briefly, the method used ollgo (dT) priming of the poly A + tails and 

5 AMV reverse transcriptase employing the method of Okayama, H. et al., 
MoT, cell Biol., 3^:280-289 (1983) This method results 1n a higher 
proportion of full length clones than does poly (dG) tailing and 
effectively uses as host vector portions of two vectors therein 
described, and readily obtainable from the authors, pcDVl and pLl. 

10 The resulting vectors contain the Insert between vector fragments 
containing proximal BamH I and Xhol restriction sites; the vector 
contains the pBR322 origin of replication, and amplcllHn resistance 
gene and SY40 control elements which result In the ability of the 
vector to effect expression of the Inserted sequences 1n COS-7 cells. 

15 A 1.2 x 10 6 clone library In E. coll obtained from the above 

enriched MIA PaCa-2 mRNA by the Okayama and Berg method was then 
probed using the same oligonucleotide probe that yielded a positive 
signal on the most active pooled MIA PaCa-2 mRNA fractions. To probe 
the library, E. coll containing the Okayama-Berg vectors were grown up 

20 on nutrient medium. Colonies were lifted onto nitrocellulose filter 
papers and were lysed. DNA was fixed to the filter by treatment for 
five minutes with O.S mM NaOH, 1.5 M NaCI. Filters were then washed 
twice for five minutes each time with 1.5 M Trls pH 8, 3 M NaCI and 
were air dried and baked at 80 °C for two hours. 

25 The filters for the screening were prehybrldlzed and 

hybridized to the y 32 P labeled probe as described above 1n Section B 
of the example, but both prehybrldlzatlon and hybridization were 
carried out at 50°C. Plasmlds pP12 and pP28 were determined to be 
probe positive and were further characterized. 

30 0. Sequencing of G-CSF MIA PaCa-2 Plasmlds cDNA 

pP12 was digested with BamH I and subcloned Into a M13mpl9 
vector and sequenced using the dldeoxy chain- termination method. 
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The DMA sequence and predicted protein sequence of pP12 are 
shown 1n Figure 4. The cDNA Insert 1n pP12 1s 1510 base pairs long 
excluding the poly (dG) and poly (dA) tails; 1t contains 11 more bases 
of 5' untranslated sequence than the CHU-2 G-CSF clone. The major 

5 difference between this clone derived from MIA PaCa-2 and CHU-2 cDNA 
clones of Nagata et al. Is a 9 base pair Insertion (GTGATGGAG) In the 
CHU-2 clone that would encode the amino acid residues Val-Ser-Glu just 
prior to cys-36 1n the MIA PaCa-2 G-CSF as indicated by the arrow In 
Figure 4. There are two other differences; an A at position 588 In 

10 the MIA PaCa-2 clone (G 1n the CHU-2 clone) 1s a silent third base 
change, and a T at position 1237 1n the MIA PaCa-2 clone (C 1n the 
CHU-2 clone) 1s 1n the 3' untranslated region. 

E. Activity of the pP12 and pP28 Proteins Produced In COS-7 Cells 

Plasmids pP12 and pP28 were purified using a CsCl gradient 
15 and COS-7 cells were transfected using a modification {Wang, A. M., et 
al., Science, 228:149 (1985)) of the calcium phosphate copredpitatlon 
technique. After incubation for three days, supernatants were 
collected and G-CSF production was assayed 1n a colony forming assay 
and the murine bone marrow proliferation assay, as described above. 

20 CSF activity was measured by both murine bone marrow cell 

proliferation and colony formation assays. For the colony assay, 1 x 
10^ mononuclear bone marrow cells from Balb/c mice were used in a 
total of 1 ml, containing 51 cell supernatant, 15% fetal calf serum, 
and 0.3% agar In RPMI 1640 medium. Colonies of greater than 40 cells 

25 were counted after seven days, and cell type determined by 
cytocentrlfuglng individual colonies and staining with a modified 
Wright stain. Each unit of CSF stimulates the formation of one 
colony. For a control, COS-7 cells were mock transfected under the 
same conditions by carrying the cells through the transfection steps 

30 without exposure to a plasmld. The results of the assay are described 
below In Table I. 
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TABLE I 

Production of CSF Activity by Transfect ed COS-7 Cells 

Colony Assay 

Trans fee Hon Proliferation (cpm) UnUs/nfl %Gl 
5 pP12 4595 >280 58 42 

pp 2 8 6594 >280 83 17 

Mock 633 0 0 0 

a = Colonies containing only granulocytes. 

b - Colonies containing both granulocytes and macrophages. 

10 EXAMPLE II 

Cloning G-CSF for Expr ession 1n E. cpl1 

A . Preparation of G-CSF Sequence: Insert ion of 5' Hindi II 
and ATti Codon "~~ 

In order to facilitate manipulation of the G-CSF DNA 
15 sequence In various cloning and expression vectors, the G-CSF sequence 
was altered by Inserting a Hlndlll site and ATG codon In mG-CSF 3* of 
the GCC codon encoding alanine (-1). 

Plasmld pP12 was digested with BamHI endonuclease and the 
digest was electrophoresed 1n a THs-acetate. 0.5% low-melt1ng point 
20 agarose gel. The small fragment band carrying the G-CSF DMA sequence 
was melted and llgated Into BamHI-digested M13rapl9 using T4 DMA Ugase 
under sticky end conditions. After ligation, the mixture was used to 
transform E^ coVi. strain DG98, ATCC accession number 39.768. The 
transformed cells were plated 1n the presence of 0.3 mM 
25 isopropylthlogalactoslde (IPTG) obtained from Sigma Chemicals (St. 
Louis, MO) and 0.3 uM X-gal on a lawn of DG98 and grown at 37°C. Non- 
alpha complementary white plaques were grown In liquid broth and a 
sample of the culture was used to purify repUcatlve form (RF) DNA. 
The presence of the Insert was confirmed by BamHI digestion and sizing 
30 the fragment on a 0.7% agarose gal. Phage having the expected Insert 
were designated pA3. 
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A Hindi 1 1 site and ATG Initiation codon were placed In the 
G-CSF sequence adjacent to and 1n frame with the codon encoding the N- 
termlnal threonine of the mG-CSF sequence by means of site-specific 
mutagenesis using a chemically synthesized purified 33-mer 
5 ollgodeoxyrlbonucleotlde having the following sequence: 

5'-GTG CAG GAA GCC AAG CTT ATG ACC CCC CTG GGC-3'. 

The Hlndlll site Immediately upstream of the ATG codon 
permits the convenient cloning of the mG-CSF-encodlng DNA sequence In 
subsequent manipulation. Approximately 10 pmoles (pM) of the 33-mer 

10 was hybridized to about 1 pM of single stranded DNA from pA3 1n about 
13 vl of 10 mM Tr1s pH 7.4, 90 mM MaCl, 10 mM MgCl 2 by heatlrtg to 85°C 
for five minutes, followed by 45°C for 20 minutes. The annealed 
mixture was chilled on 1ce and adjusted to 18 v l by the addition of 
dithlothrietol to 10 mM, each dXTP to 0.5 mM and 5 units of DNA 

15 Polymerase I Klenow fragment. The reaction mixture was Incubated on 
Ice for 20 minutes followed by Incubation at room temperature for one 
hour. The repair reaction mixture was then used to transform E. coll 
strain DG98 as above, plated onto agar plates and incubated overnight 
to obtain phage plaques. 

20 Plaques were blotted using nitrocellulose filter circles and 

the filters were treated as described to lyse the cells, denature the 
DNA, neutralize, rinse, fix the DNA to the filter and incubate In 
prehybridizatlon buffer. The 33-mer oligonucleotide was end labeled 
with y 32 P using polynucleotide kinase and hybridized to the filters at 

25 45°C overnight. Filters were washed and subjected to 
autoradiography. Probe positive plaques were grown in liquid culture 
and characterized by BamH I Hlndlll digestion and agarose gel sizing to 
confirm the presence of the expected fragments. One clone having the 
desired Insert was designated pApooL. pApooL was digested with BamH I 

30 and Hlndlll. The approximately 1500 base pair BamHI-Hlndlll fragment 
was Isolated, purified, and retained for ligation Into a plasmld 
having the desired promoter and terminator sequences. 
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B. Placement of 6-CSF Coding Sequence Under Trp Control 

To place the 6-CSF coding sequence under Trp control and 
under the Influence of the BT positive retroregul atory element (PRE), 
the G-CSF coding sequence was cloned Into plasmid pAW703. pAW703 Is a 
5 plasmid having the Trp promoter and PRE In operable linkage with a 
Hlnd lH -BamH I fragment which contains the gene encoding human tumor 
necrosis factor (TNF). 

As described below 1n detail, pAW703 1s digested with BamHI 
and Hlndlll and the BamHI and Hlndlll fragment of pApool encoding G- 
10 CSF 1s Ugated 1n place of the TNF fragment. 

The following describes the construction of pAW703 and the 
cloning of G-CSF Into this plasmid to make pPDl. An alternate 
construction using pTrp3 Is also described. 

B.l. Construction of pAW703 

15 Plasmid pAW703 was made from plasmlds pAW701 and pAW711 as 

follows. Plasmid pAW711 (ATCC 39,918) which contains a 400 bp BamHI- 
Sall fragment encoding the PRE was digested with Sail and BamHI 
endonucl eases under conditions suggested by the supplier to yield an 
approximately 400 bp fragment containing the PRE. Tht digest was 

20 loaded on a 1* agarose preparatory gel and the saall Sal I-BamHI 
fragment containing the PRE was Isolated, electroeluted and retained. 

Plasmid pAW701 was used to obtain the Trp control sequences 
for regulating expression of the gene. 

pAU701, a derivative of plasmid pTrp3. was made by digesting 
25 pTrp3 with BamH I and Hindi 1 1 endonucl eases and ligation of the large 
Hindi 1 1 -BamHI fragment with HlndlH-BamHI fragment carrying the TNF 
gene. In the construction of pAW703, pAW701 was digested with BamH I 
and Sail endonucl eases. The large BamHI -Sal I fragment of pAW701 
containing the Trp promoter Hindi 1 1 site and TNF sequence was Isolated 
30 by electroelution from a preparative agarose gel, and was Ugated to 
the small BamHI -Sal I fragment containing the PRE, under sticky end 
conditions using T4 DMA ligase to produce pAW703. 
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Both pAW703 and pApool were digested with BamH I and Hlndlll 
under conditions suggested by the manufacturer. The large Hlndlll- 
BamHI fragment from pApooL was gel Isolated and 11 gated with the 
pAW703 digest using T4 DNA Hgase. 

5 The ligation mixture was used to transform competent E. coll 

strain W294. Transformants were screened for amplclllln 
resistance. Plasmlds from a number of amplclllln resistant colonies 
were Isolated, digested with Hlndlll and BamHI and run on a 1.5% 
agarose gel with DNA sizing standards. One clone having the expected 
10 size fragments was designated pPDl. 

B.2. Trp Expression Vectors 

pTrp3 is a host expression vector containing a Trp promoter 
and RBS Immediately 5' of a Hlndlll restriction site, thus permitting 
Insertion of a coding sequence having a Hlndlll site Immediately 5' of 
15 a start codon under control of the Trp promoter. The backbone vector 
for pTrp3 Is pBR322. pTrp3 was deposited with the American Type 
Culture Collection, 12301 Parklawn Drive, Rockville, MD (ATCC) on 
December 18, 1984, and has accession number 39,946. 

B.2.A. Preparation of pTrp3 

20 To construct the host vector containing the Trp control 

sequences behind a Hlndlll site, the Trp promoter/operator/ribosome 
binding site sequence, lacking the attenuator region, was obtained 
from pVH153, obtained from C. Yanofsky, Stanford University. Trp 
sequences are available In a variety of such plasmlds known In the 

25 art. pYH153 was treated with Hhal (which cuts leaving an exposed 3' 
sticky end just 5' of the Trp promoter) blunt ended with Klenow, and 
partially digested with Tagl. The 99 bp fragment corresponding to 
restriction at the Tajgl site, 6 nucleotides preceding the ATG start 
codon of Trp leader were Isolated, and then 11 gated to EcoR I 

30 (repair )/ClaI digested, pBR322 to provide pTrp3. 

An Improved form of pTrp3 also Includes a positive 
retroregulatory sequence In a position 1n the vector downstream of the 
coding sequence to be expressed as described herelnbelow. 
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An expression vector for G-CSF having the same control 
sequences as pAW703 may be made directly from pTr P 3 (ATTC 39,940) and 
pAW711 (ATCC 39.918) and pP12 as follows. 

P Trp3 1s digested with Hindi 1 1 and Sail. The large Hindi II- 

5 Sail fragment 1s Isolated by electroelutlon from a preparative gel. 
The small BamHI-Sall fragment containing the PRE 1s Isolated from 
pAW711 as described above and Ugated to the large Hlndlll-Sall 
fragment of pTrp3 under sticky end conditions with T4 DNA Ugase. 
pPDl may also be made directly by combining the BamHI-Hlndlll fragment 

10 of pP12 with the above-mentioned fragments from pTrp3 and pAW711 and 
Ugating under sticky end conditions with T4 DNA Hgase. 

C. P lacing a BamHI Restriction Site Dow nstream of the G-CSF 
Translation I erml nation j>\ze ^ 

The DNA sequence encoding G-CSF 1n pA3 contained 
15 approximately 930 nucleotides In the transcribed but untranslated 
region 3' of the translation stop codon TGA 1n the G-CSF sequence. In 
order to remove these untranslated codons and to place transcription 
termination and mRNA stability under the Influence of the positive 
retroregulatory element, a BamHI site was placed by site-specific 
20 mutagenesis Immediately 3' of the G-CSF translation termination 
signal.' 

A chemically synthesized purified 27-mer deoxy ribonucleotide 
w was chemically synthesized using a commercial DNA synthesizer and had 
"the following sequence: 
25 % CCC AGC CCT GAG GAT CCA AGC CCT CCC 

Approximately 10 pM of the 27-mer was hybridized to about 1 
pM of single-stranded DMA from PpA3 1n about 13 pi of 10 mM Tr1s pH 
7.4, 90 mM NaCl, 10 mM MgCl 2 by heating to 85°C for five minutes, 
followed by 45°C for 20 minutes. The annealed mixture was chilled on 
30 1ce and adjusted to 18 v l by the addition of dlthiothreltol to 10 mM. 
each dXTP to 0.5 mM and 5 units of DMA Polymerase I Klenow fragment. 
The reaction mixture was Incubated on ice for 20 minutes, followed by 
Incubation at room temperature for about one hour. 
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The reaction mixture was then used to transform coll 
strain DG98, plated onto agar plates and Incubated overnight to obtain 
phage plaques. Plaques were blotted using nitrocellulose filter 
circles arid the filters were treated as described to lyse the cells, 
5 denature the DNA, neutralize, rinse, fix the DNA to the filter and 
Incubate 1n prehybrldizatlon buffer. The 27-mer oligonucleotide was 
end labeled with 32 y phosphate using polynucleotide kinase and 
hybridized to the filters at 45°C overnight. Filters were washed and 
subjected to autoradiography. One of the probe positive plaques was 
10 designated pBpool. 

pBpool was grown 1n liquid culture and digested with Apal 
and BamH I endonucl eases under conditions specified by the 
manufacturer. The coding region of G-CSF has an Internal Apal site. 
This digestion generated an Apal -BamH I fragment of approximately 540 
15 base pairs, with the newly inverted BamH I end immediately 3' of the 
translation stop codon of the mG-CSF gene. 

Plasmid pPDl was also digested with Apal and BamH I under 
conditions suggested by the supplier. This digestion yielded an Apal- 
BamH I fragment in which the Apal site was the same as that described 
20 above and the BamH I site 1s 3* of the untranslated region of the G-CSF 
gene. 

The two digests were mixed at a molar excess of digested 
pBpool to pPDl and 11 gated under sticky end conditions using T4 DMA 
ligase. The ligation mixture was transformed into coll strain 
25 MM294. Transformants were screened for amplcill In resistance. 
Several colonies were Isolated and digested with HindHI and BamHI, A 
colony having the correct length Hindi I I -BamHI Insert was designated 
pPD2. 

D. Placing mG-CSF Under Pj Control 

30 Plasmid pLAPHp is a plasmid having the P L promoter and gene 

N-RBS controlling the expression of the Aphg gene. The Aphe gene Is 
contained in a H1ndIII-BamHI fragment. The plasmid further contains 
the positive retroregul atory element 3' to the Aphs gene with a BamHI 
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site directly 5' of the PRE. Plasmld plAphs was made from plasmld 
pFC54.T (ATCC number 39.739). Plasmld pLAPHs 1s exactly the same as 
pFC54.T except that the latter plasmld has a Hlndlll-BamHI fragment 
encoding des-alanyl-IL-2 1n place of the Aphe encoding fragment. 
5 Thus, the following steps to place the expression of G-CSF under 
temperature sensitive P L promoter-gene N-RBS and PRE control can also 
" be carried out using pFC54.T. The Hlndlll-BamHI fragment of pPD2 
having the ATG Initiation codon was placed under P L promoter control 
and PRE Influence as follows: 
10 Plasmld pLAPHs was digested with Hlndlll and BamHI under 

conditions suggested by the supplier. Plasmld pPD2 was digested with 
the same endonucl eases. The digests were combined and 11 gated with T4 
ONA ligase with the pPD2:pLAPHB digests at a 10:1 molar ratio. 

After ligation, the ligation mixture was used to transform 
IS competent coli_ host strain XNDG95. Amplclllln resistant colonies 
were selected and plasmlds were isolated therefrom, digested with 
BamHI and Hindi II and a transformant having the expected insert of 
approximately 540 bp was designated pJDl. 

EXAMPLE III 
20 Expression of mG-CSF In dPD2 and pJDl 

A. Growth and Induction of pP D2 in Transformed E. coli Strains 
MH294 and pJDl In Transformea t. con m xn DS9t> 

Initial attempts to produce raS-CSF using the two different 
expression vectors were not successful. The first vector, pJOl, 

25 (Figure 5) consisted of the DMA encoding the mature mG-CSF protein 
under the control of the bacteriophage x P L promoter and the X gene N- 
RBS. The Bacillus thuringiensis positive retroregulatlng element 
(PRE) 1s immediately downstream of the mG-CSF coding region. In 
addition, the origin of replication, derived from ColEi, contained 

30 mutations that conferred a temperature-sensitive Cop" phenotype. l± 
con cells, strain DG95x (x N7H33 C1837 sus P80). harboring pJDl were 
grown at 30°C to an optical density of 0.3 at 600 nm. The P L promoter 
and plasmld replication were then Induced by shifting the temperature 
to 42'C for three hours. 
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The second vector pPD2, consisted of DNA encoding the mG-CSF 
protein under the control of the E. coli Trp promoter. The Bacillus 
thur1ng1ens1s positive retroregulatlng element was Immediately 
downstream of the mG-CSF coding region. £. coll cells, strain W4294, 
5 were grown overnight In M9 medium plus 100 pg/ml of Trp and 50 ug/ml 
of amptcilUn. In the morning the cells were washed twice with M9 
medium and the Trp promoter was then induced by setting up cultures at 
an optical density of 0.05 at 600 nm in M9 medium without Trp. The 
cells were grown at 37°C to an optical density of 0.8 at 600 nm. 

10 After induction of pJDl and pPD2 f extracts of Induced and 

uninduced cells were prepared by boiling the cells for five minutes in 
Laemmli loading buffer (Laemmli, Nature, 227:680-685 (1970)) which 
contains sodium dodecyl sulfate (SOS) and 2-mercaptoethanol (2ME). 
After electrophoresis through 12.5* polyacryl amide gels (PAGE), 

15 containing SDS and 2ME, and staining with Coomassle brilliant blue, an 
induced protein band of the molecular weight corresponding to mG-CSF 
could not be detected. 

B. Steady State mG-CSF mRNA Level After Induction of Cells 
Harboring pJDl and pPD2 — 

20 To determine the steacty level of G-CSF transcripts 1n cells 

harboring pJDl and pPD2, RNA was prepared as follows: 20 ml of 
induced or uninduced cells was mixed with 20 ml of ice cold phosphate 
buffered saline. The cells were pelleted by centrifugation at 4°C and 
resuspended in 3 ml of 0.15 M sucrose-20 mM sodium acetate. 0.3 ml of 

25 a 20 mg/ml lyso^yme solution was added and the mixture was incubated 
on 1ce. 0.15 ml of 201 SDS was added and the mixture was extracted 
with equal volumes of phenol : chloroform (1:1) until the interface was 
clear (about 4 times). After two ethanol precipitations, the nucleic 
acid mixture was resuspended 1n 100 pi of 20 mM Tris (pH 7.5)-10 mM 

30 MgCl 2 . Two microliters of a 1 mg/ml solution of RNase-free DNase I 
was added and the mixture was incubated at 37°C for 10 minutes. Five 
yl of 201 SDS was added and the samples were then phenol :chloroform 
(1:1) extracted twice and ethanol precipitated 2 times. 
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The Northern blotting analysis was performed as described 
using a modification of a published protocol (Manlatls et al., 
Molecular Cloning; A Laboratory Manual. Cold Spring Harbor, N.Y. 
(1982)). Briefly, Northern blots were prepared by the electrophoresis 
5 of RNA through If agarose gels containing 0.5 M formaldehyde followed 
by blotting onto nitrocellulose filters. After baking the filters for 
1.5 hours at 80*C, they were prehybrldized for one hour at 45°C in: 5 
x SSC, 0.2* polyvinylpyrrolidone, 0.2* Flcoll, 0.21 bovine serum 
albumin, 0.1% SOS, 50 mM sodium phosphate pH 7.0, and 100 M g/ml 

10 tRNA. The blots were hybridized for 16 hours at 45°C In a similar 
solution that also contained dextran sulfate (10*) and 10 6 cpm per ml 
of the mG-CSF specific oligonucleotide S'-CGCTGCGCCATCGCCCrGGATCTT-S 1 
that had been labeled with y 32 P ATP and polynucleotide kinase. The 
blots were washed In 3.2 M tetramethyl ammonium chloride for 15 minutes 

15 at 62°C. The bands corresponding to mG-CSF mRNA were then detected by 
autoradiography. 

Figure 10 shows that Induced cells harboring pJDl contained 
a mG-CSF mRNA of the expected size as well as smaller mG-CSF related 
mRNA while cells harboring pP02 did not contain detectable mG-CSF 
20 mRNA. 

EXAMPLE IV 

A. Construction of a P| Vector with Alternative mG-CSF Codons 

Two ollgodeoxyrlbonucleotides were chemically synthesized 
and purified and had the following sequences: A * 5'- 
25 AGCTTATGACACCATTAGGAC-3', 8 ■ 5 1 -TAATGGTGTCATA-3 1 . 

DNA of pJDl was digested with the restriction enzymes 
Hindlll and Apal and, after electrophoresis, the large fragment was 
isolated from a Tris-acetate, 0.5* low-melting-point agarose gel 
(Crease et al., Methods in Enzymoloov ; 101:78 (1983)). This fragment 
30 was mixed with 20 pmole of each of the above two oligonucleotides. 
This mixture was llgated with T4 DNA llgase, and used to transform 
competent E. coll cells of the strain DG95. The cells were plated on 
agar plates containing amp1c1111n and colonies containing recombinant 



10 



0256843 

37 

plasmlds (pJD4A and pJD4B) were Identified by colony hybridization 
(Manlatis et al. supra) using oligonucleotide A which had been labeled 
with y 32 P ATP and polynucleotide kinase. As shown 1n Figure 11, pJD4A 
and pJD4B, both encode the same protein as pJDl; however, codons 2, 3, 
and 4 of pJD4A and pJD4B are different from the ones used 1n pJDl. In 
addition, codon 5 In pJD4B also differs from pJDl. This difference 
between pJD4A and pJD4B results from the fact that the end of the 
oligomer duplex that was Hgated to the Apal site contained a single 
mismatch to the single-stranded end generated by the enzyme Apa l. 
This type of mismatch will not interfere with ligations (Hung and 
Wenslnk, Nucleic Acids Research . _12 .-1863-1874 (1984)); however, 
depending on the repair of this mismatch, some of the resulting clones 
will regenerate the Apal site and have a pJDl-Hke 5th codon and some 
will lose the Apal site and have a pJD4B-Hke 5th codon. The pJD4A 
15 and pJD4B constructions were verified by DNA sequence analysis. 

B. Expression of mG-CSF Protein by pJD4A and pJD4B 

L. £2li harboring either pJD4A or pJD4B were Induced and 
analyzed by SOS-PAGE as described for pJDl in Example III. Figure 12 
shows that an inducible protein, with a molecular weight corresponding 
to that of mG-CSF. Is present in E;_ con cells that harbor either 
PJD4A or pJD4B. 
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C * Construction of a P, Vector With Alternative Codons Enco ding »n 
Amino Methionine Pepti dase Processable Amino leminur oTTIS*^ 

Two oligonucleotides were chemically synthesized and 
25 purified and had the following sequences: 

C; 5 1 AGCTTATGCCATTAGGAC3 1 
0: 3' ATACGGTAAT 5' 

DNA of plasmld pJDl was digested with restriction enzymes 
Hlndlll and Apal and the large fragment was Isolated as described In 
30 IV.A. above. 20 pM each of oligonucleotides C and D were mixed with 
the large fragment, Hgated with T 4 DNA ligase under sticky end 
conditions, and the Hgated mixture was used to transform competent E^ 
con strain DG95. Cells were plated on agar containing ampicilHn and 
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Amp resistant colonies containing a recombinant plasmld were 
Identified by hybridization with oligonucleotide C, which had been 
labeled with y 32 P ATP using polynucleotide kinase. The plasmld 
obtained from the probe positive colonies was designated pJD6 and 
5 encoded a G-CSF species in which the thr x Is deleted, i.e., v^mG- 
CSF. The correct sequence was confirmed by dideoxy sequencing. 

D. Expression of mG-CSF Protein by pPD6 

E. coll DG95 harboring pPD6 was Induced and analysed as 

described for pJDl in Example III. 
10 Figure 15 Is a picture of an SDS-PAGE gel of proteins 

produced by E. coli harboring pPD6 and pJD5. The cell clearly shows 

that pPD6 produce an inducable protein with a slightly smaller 

molecular weight than MG-CSF produced by pJD5. 

The protein band produced by pPD6 was isolated and purified 
15 from a large scale induction of pPD6. The purified protein was 

sequenced using an automated amino acid sequencer and showed that the 

MH 2 - terminus of this protein 1s pro rather than met or thr. 

EXAMPLE V 

A. Construction of a Trp Expression Vector Containing Alternative 
20 mG-CSF Codons 

DNA from pJD4A was digested with the restriction enzymes 
Hindlll and BamHI. The small fragment, containing the coding region 
for mG-CSF, was isolated from Tris-acetate 0.5% low-melting-point 
agarose gel. DNA from pPD2 was digested with the restriction enzymes 

25 Hindi 1 1 and BamH I and the large fragment was isolated from a Tris- 
acetate 0.51 low-melt1ng-po1nt agarose gel. These two fragments were 
Ugated together with T4 DNA llgase and used to transform competent^ 
coTj of the strain MM294. An amplcllHn resistant colony containing 
the recombinant was picked and the construct was verified by DNA 

30 sequence analysis. The resulting pi asmid was designated pPD5. 
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B. Expression of mG-CSF In pPDS 

E. col 1 harboring pPD5 was Induced and analyzed by SDS-PAGE 
as described for pPD2 1n the previous example. Figure 13 shows that 
an Inducible protein, with a molecular weight corresponding to that of 
5 mG-CSF, was present 1n E. coll cells that harbor pPD5. 

C. Using Alternative mG-CSF Codons Increases the Steady St ate Level 
of mS-CSF mftNA 

The steady state level of mG-CSF mRMA 1n Induced cells 
harboring either pPD2 or pPD5 was compared by Northern analysis as 
10 described above. Figure 14 shows that Induced cells harboring pPD5 
contain a mG-CSF transcript of the expected size, while no mG-CSF 
transcripts were detected in induced cells harboring pPD2. 

The following plasmids have been deposited 1n the American 
Type Culture Collection, Rockvllle, HD USA: 



15 Plasmld Strain Date Accession # 

pJD4A E. C0l1 K12 DG95X 8/12/86 67,181 

pJD4B TT coTT K12 DG95x 8/12/86 67,183 

pPDSA I7 coTT K12 MM294 8/12/86 67,182 

pAWll T7coTTK12 DG95X 11/8/84 39,918 

20 P Trp3 ITcoTTMClOOO 12/18/84 39,946 

pFC54.t T7coTTDG95x 8/7/84 39,789 



These deposits were made under the provisions of the 
Budapest Treaty on the International Recognition of the Deposit of 
Microorganisms for the Purposes of Patent Procedure and the 

20 Regulations thereunder (Budapest Treaty). This assures maintenance of 
a viable culture for 30 years from date of deposit. The deposits will 
be made available by ATCC under the terms of the Budapest Treaty, and 
subject to an agreement between applicants and ATCC which assures 
permanent and unrestricted availability upon issuance of the pertinent 

25 U.S. patent. The Assignee herein agrees that If the culture on 
deposit die or be lost or destroyed when cultivated under suitable 
conditions, It will be promptly replaced upon notification with a 
viable speciment of the same culture. Availability of the deposits is 
not be be construed as a license to practice the invention in 
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contrav ntion of th rights granted under the 
authority of any g v rnment in accordance with its 

patent laws. 

The invention includes the use of 6-CSF as 

defined above or a mutein as defined above in 

preparing a medicament, e.g. in preparing a 

pharmaceutical or veterinary formulation. 

In another aspect the invention includes a 
pharmaceutical or veterinary formulation comprising 
G-CSF as defined above or a mutein as defined above 
formulated for pharmaceutical or veterinary use 
respectively, optionally the formulation being in 
unit dosage form and/or also comprising an acceptable 
diluent, carrier or excipient. 

The inventors have disclosed G-CSF and means 
for the efficient expression thereof. Variations 
within the scope of the invention may be made by 
those ordinarily skilled in the art without departing 
from the essence of the invention as claimed herein. 
The illustrative embodiments given hereinbefore are 
not limiting. 
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CLAIMS ; 

1. A modified DNA sequence encoding G-CSF, said G-CSF 
having the effect of stimulating the production of primarily 
granulocyte or granulocyte-macrophage colonies 1n colony forming assay 
5 using bone marrow cell progenitors of an appropritate species, wherein 
the 5' region thereof comprises deoxyrlboadenlne substituted for the 
deoxyriboguanine or deoxyribocytosine in the third position of at 
least one codon wherein said substitution does not change the amino 
acid sequence of G-CSF. 

10 2. The DNA sequence according to claim 1 wherein said 5 1 

region comprises a DNA sequence encoding the amino add sequence Thr 
Pro Leu Gly or Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro. 

3. The DNA sequence according to claim 2 wherein at least 3 
codons have said substitution. 

15 4. The DNA sequence according to claims 1, 2 or 3 further 

comprising deoxyri bo thymidine substituted for deoxyribocytosine 1n the 
first position of at least one codon. 

5. A modified DNA sequence encoding mature G-CSF wherein 
the 5' region thereof comprises codons selected from the group 

20 consisting of ACA CCA TTA GGA, ACA CCA TTA, ATG ACA CCA TTA GGA and 
ATG ACA CCA TTA. 

6. A modified DNA sequence encoding a muteln of G-CSF, said 
mutein of G-CSF having the effect of stimulating the production of 
primarily granulocyte or granulocyte-macrophage colonies in colony 

25 forming assays using bone marrow cell progenitors of an appropriate 
species, wherein the muteln does not have an NH 2 -terminal methionine 
when produced from a recombinant host. 
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7. The modified DNA sequence encoding a muteln of G-CSF of 
claim 6 wherein the NH 2 - terminal end of said muteln encodes a sequence 
that 1s processed by met amino peptidase. 

8* The modified DNA sequence encoding a muteln of G-CSF of 
5 claim 7 wherein the DMA sequence encodes an amino terminal end 
selected from the group consisting of met-ala, met-gly, met-pro, met- 
al a-met, met-gly-met, met-ala-ser t met-ala-pro, met-pro- thr, and raet- 
pro-leu. 

9. An expression vector comprising the DNA sequence of 
10 claims 1 to 8 in operable linkage with a promoter and ribosome 

binding site. 

10. The expression vector of claim 9 in operable linkage 
with a promoter-ribosome binding site selected from the group 
consisting of P L promoter and ribosome binding sites operable 

15 therewith and Trp promoter-ribosome binding site. 

11. An E. coli host transformed with the expression vector 
of claim 9 or claim 10. 

12. A culture of E. coli transformed with the expression 
vector of claim 10 having a G-CSF content of at least about 3% of 

20 total cell protein to about 10% of total cell protein. 

13. Human recombinant G-CSF. 

14. An mRNA fraction isolated from human cells that encodes 
human G-CSF. 

15. The expression vector of claim 10 comprising pJD4 or 

25 pPD5. 
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16. A muteln of G-CSF, said muteln of G-CSF having the 
effect of stimulating the production of primarily granulocyte or 
granulocyte-macrophage colonies in colony forming assays using bone 
marrow cell progenitors of an appropriate species wherein said muteln 

5 does not have an NH 2 -terminal methionine. 

17. The mutein of claim 16 wherein the NH 2 -term1nal end of 
said mutein has an amino acid sequence that is processed by met amino 
peptidase in a recombinant host. 

18. The mutein of G-CSF of claim 17 wherein the amino 
10 terminal end is selected from the group consisting of met-ala, met- 

gly, met-pro, met-ala-met, met-gly-met, met-ala-ser, met-ala-pro, met- 
pro-thr and met-pro-leu. 

19. The muteln of G-CSF of claim 18 which yields Vj-mG-CSF 
when expressed in a recombinant host. 

15 20. A mutein of mG-CSF selected from the group consisting 

of alaj-mG-CSF, glyj-mG-CSF, pro 1 -mG-CSF, prOj-thr^mG-CSF, ala 1 -met 2 - 
mG-CSF, glyj-met^-mG-CSF, ala 1 -ser 2 -mG-CSF f alaj-pro^mG-CSF, proj- 
leu 2 -mG-CSF and Vj^mG-CSF. 

21. v^mG-CSF. 
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22. A method of producing a mod if i d DNA 

s quence encoding G-CSF, said G-CSF having the eff ct 
of stimulating the production of primarily 
granulocyte or granulocyte-macrophage colonies in 
colony forming assay using bone marrow cell 
progenitors of an appropriate species, which method 
comprises substituting deeoxyriboadenine at the 5' 
region of the sequence for the deoxyriboguanine or 
deoxyribocytosine in the third position of at least 
one codon wherein said substitution does not change 
the amino acid sequence of G-CSF. 

23. A method according to claim 22 and 
further defined by the specific feature(s) of any one 
or more of claims 2 to 4. 

24. A method of making a modified DNA 
sequence encoding mature G-CSF which includes 
selecting codons for the 5' region thereof from the 
group consisting of ACA CCA TTA GGA, ACA CCA TTA, ATG 
ACA CCA TTA GGA and ATG ACA CCA TTA. 

25. A method of producing a modified DNA 
sequence encoding a mutein of G-CSF, said mutein of 
G-CSF having the effect of stimulating the production 
of primarily granulocyte or granulocyte-macrophage 
colonies in colony forming assays using bone marrow 
cell progenitors of an appropriate species, wherein 
the mutein does not have an NH 2 -terminal methionine 
when produced from a recombinant host, comprising 
modifying the NH 2 -terminal sequence encoding 
portion of a DNA sequence encoding 6-CSF, for example 
by subjecting cDNA encoding native G-CSF to site 
specific mutagenesis to produce an NH 2 -terminal 

s quenc wh rein the NH 2 -t rminal m thi nine is 
susceptible t nzymatic cl avage or ligating an 
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oligonucleotide encoding such an NH 2 -terminal 
sequence to a specifically digested G-CSF DNA 
sequence. 

26. A method according to claim 25 and 
further defined by the specific feature of claim 7 or 
of claim 8* 

27. A process for preparing an expression 
vector which comprises forming an operable linkage 
between a promoter and ribosome binding site and a 
DNA sequence as defined in any one of claims 1 to 8, 
the promoter-ribosome binding site preferably being 
selected from the group consisting of P L promoter 
and ribosome binding sites operable therewith and Trp 
promoter-ribosome binding site. 

28. A method of making an E. coli cell 
capable of expressing G-CSF which comprises 
transforming an E. coli host with an expression 
vector as defined in claim 9, claim 10 or claim 15. 

29. A process for the preparation of 
recombinant G-CSF which comprises culturing an E. 
coli transformed host as defined in claim 11 or a 
culture as defined in claim 12 or E. coli cells made 
by the method of claim 28 so as to permit expression 
of the E. coli genome. 

30. A process according to claim 29 wherein 
the G-CSF prepared thereby is as defined in any ne 
of claims 16 to 21. 

31. A pharmaceutical or veterinary 
formulati n comprising G-CSF as defined in claim 13 
or a mutein as defined in any one of claims 16 t 21 
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formulated for pharmaceutical or v t rinary us 
respectively, optionally the formulation being in 
unit dosage form and/or also comprising an acceptable 
diluent, carrier or excipient. 

32. The use of G-CSF as defined in claim 13 
or a mutein as defined in any one of claims 16 to 21 
in preparing a medicament, e.g. in preparing a 
pharmaceutical or veterinary formulation as defined 
in claim 31. 
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FIG. 4 

1 AAAAAACAGCCCGGAGCCTGCAGCCCAGCCCCACCCAGACCC ATGGCTGGACCTGCnArr 

netAiaGiyProAlaThr 
-30 

6 1 CAGAGCCCCATGAAGCTGATGGCCCTGCAGCTGCTGCTGTGGCACAGTGCACTCTGGACA 
GlnSerProMetLysLeuMetAlaLeuGlnLeuLeuteuTrpHisSerAlaLenTrpThr 

121 GTGCAGGAAGCCACCCCCCTGGGCCCTGCO^GCTCCCTGCCCCAGAGCTTCCTGCTCAAG 
ValGlnGluAlaThrProLeuGlyProAlaSerSerLeuProGlnSerPheLeuLeuLys 
-11 10 

1 81 TGCTTAGAGCAAGTGAGGAAGATCCAGGGCGATGGCGCAGCgCTC CAGGAGAAGCTGTGT 

CysLeuGluGlnValArgLysIleGlnGlyAspGlyAlaAlaLeuGlnGluLysLeuCys 
20 " 30 

241 GCCACCTACAAGCTGTGCCACCCCGAGGAGCTGGTGCTGCTCGGACACTCTCTGGGCATC 
AlaThrTyrLysLeuCysHisProGluGluLeuValLeuLeuGlyHisSerLeuGlylle 

3 01 CCCTGGGCTCCCCTGAGCAGCTGCCCCAGCCAGGCCCTGCAGCTGGCAGGCTGCTTGAGC 
ProTrpAlaProLeuSerSerCysProSerGlnAlaLeuGlnLeuAlaGlyCysLeuSer 

361 CAACTCCATAGCGGCCTTTTCCTCTACCAGGGGCTCCTGCAGGCCCTGGAAGGGATCTCC 
GlnLeuHisSerGlyLeuPheLeuTyrGlnGlyLeuLeuGlnAlaLeuGluGlylleSer 
80 90 

421 CCCGAGTTGGGTCCCACCTTGGACACACTGCAGCTGGACGTCGCCGACTTTGCCACCACC 
ProGluLeuGlyProThrLeuAspThrLeuGlnLeuAspValAlaAspPheAlaThrThr 
100 , 110 

481 ATCTGGCAGCAGATGGAAGAACTGGGAATGGCCCCTGCCCTGCAGCCCACCCAGGGTGCC 
IleTrpGlnGlnMetGluGluLeuGlyMetAlaProAlaLeuGlnProThrGlnGlyAla 
120 130 

541 ATGCCGGCCTTCGCCTCTGCTTTCCAGCGCCGGGCAGGAGGGGTCCTAGTTGCCTCCCAT 
MetProAlaPheAlaSerAlaPheGlnArgArgAlaGlyGlyValLeuValAlaSerHis 
140 150 

601 CTGCAGAGCTTCCTGGAGGTGTCGTACCGCGTTCTACGCCACCTTGCCCAGCCCTGAGCC 
LeuGlnSerPheLeuGluValSerTyrArgValLeuArgHisLeuAlaGlnPro 
160 170 

661 AAGCCCTCCCCATCCCATGTATTTATCTCTATTTAATATTTATGTCTATTTAAGCCTCAT 
7 21 ATTTAAAGACAGGGAAGAGCAGAACGGAGCCCCAGGCCTCTGTGTCCTTCCCTGCATTTC 
781 TGAGTTTCATTCTCCTGCCTGTAGCAGTGAGAAAAAGCTCCTGTCCTCCCATCCCCTGGA 
841 CTGGGAGGTAGATAGGTAAATACCAAGTATTTATTACTATGACTGCTCCCCAGCCCTGGC 
901 TCTGCAATGGGCACTGGGATGAGCCGCTGTGAGCCCCTGGTCCTGAGGGTCCCCACCTGG 
961 GACCCTTGAGAGTATCAGGTCTCCCACGTGGGAGACAAGAAATCCCTGTTTAATATTT AA 
1021 ACAGCAGTGTTCCCCATCTGGGTCCTTGCACCCCTCACTCTGGCCTCAGCCGACTGCACA 
1081 GCGGCCCCTGCATCCCCTTGGCTGTGAGGCCCCTGGACAAGCAGAGGTGGCCAGAGCTGG 
1141 GAGGCATGGCCCTGGGGTCCCACGAATTTGCTGGGGAATCTCGTTTTTCTTCTTAAGACT 
1201 TTTGGGACATGGTTTGACTCCCGAACATCACCGACGTGTCTCCTGTTTTTCTGGGTGGCC 
1261 TCGGGACACCTGCCCTGCCCCCACGAGGGTCAGGACTGTGACTCTTTTTAGGGCCAGGCA 
^GCCTGGACATTTGCCTTGCTGGACGGGGACTGGGGATGTGGGAGGGAGCAGACAGGA 
13 81 GGAATCATGTCAGGCCTGTGTGTGAAAGGAAGCTCCACTGTCACCCTCCACCTCTTCACC 
1441 CCCCACTCACCAGTGTCCCCTCCACTGTCACATTGTAACTGAACTTCAGGATAATAAAGT 
1501 GTTTGCCTCCAAAAAAAAAAAAAAA 
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1 2 3 4 5 

MET THR PRO LEU GLY 

G-CSF CODONS ATG ACC CCC CTG GGC 
pJD4B, pPD5 ATG ACA CCA TTA GGA 
pJD4A ATG ACA CCA TTA GGC 

FIG. II 
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